AITopics | source dataset

Collaborating Authors

source dataset

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model Performance

Neural Information Processing SystemsJun-13-2026, 15:52:57 GMT

Recent work on large language models (LLMs) has increasingly focused on post-training and alignment with datasets curated to enhance instruction following, world knowledge, and specialized skills. However, most post-training datasets used in leading open-and closed-source LLMs remain inaccessible to the public, with limited information about their construction process. This lack of transparency has motivated the recent development of open-source post-training corpora. While training on these open alternatives can yield performance comparable to that of leading models, systematic comparisons remain challenging due to the significant computational cost of conducting them rigorously at scale, and are therefore largely absent. As a result, it remains unclear how specific samples, task types, or curation strategies influence downstream performance when assessing data quality. In this work, we conduct the first comprehensive side-by-side analysis of two prominent open post-training datasets: Tulu-3-SFT-Mix and SmolTalk. Using the Magpie framework, we annotate each sample with detailed quality metrics, including turn structure (single-turn vs. multi-turn), task category, input quality, and response quality, and we derive statistics that reveal structural and qualitative similarities and differences between the two datasets.

large language model, machine learning, natural language, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.84)
Information Technology > Artificial Intelligence > Machine Learning (0.80)

Add feedback

Mixed Supervised Object Detection by Transferring Mask Prior and Semantic Similarity

Neural Information Processing SystemsApr-25-2026, 01:34:17 GMT

Object detection has achieved promising success, but requires large-scale fullyannotated data, which is time-consuming and labor-extensive. Therefore, we consider object detection with mixed supervision, which learns novel object categories using weak annotations with the help of full annotations of existing base object categories. Previous works using mixed supervision mainly learn the classagnostic objectness from fully-annotated categories, which can be transferred to upgrade the weak annotations to pseudo full annotations for novel categories. In this paper, we further transfer mask prior and semantic similarity to bridge the gap between novel categories and base categories. Specifically, the ability of using mask prior to help detect objects is learned from base categories and transferred to novel categories. Moreover, the semantic similarity between objects learned from base categories is transferred to denoise the pseudo full annotations for novel categories. Experimental results on three benchmark datasets demonstrate the effectiveness of our method over existing methods.

category, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

ASelf Supervised Learning Methods

Neural Information Processing SystemsApr-24-2026, 16:09:23 GMT

L.1 Source Dataset: ImageNet Table 13 and Table 14 describe 5-way 1-shot and 5-way 5-shot CD-FSL performance when ImageNet is used as the source dataset, respectively. Note that Table 14 is added for convenience and this is the same with Table 3 in the main paper.

artificial intelligence, inductive learning, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback

A Datasheet for S

Neural Information Processing SystemsFeb-17-2026, 07:57:41 GMT

These sentence pairs will serve as a means to evaluate machines' commonsense reasoning abilities under different extra-linguistic contexts.

artificial intelligence, commonsense reasoning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.05)
Asia > China (0.05)
North America > United States > Texas (0.04)
(6 more...)

Industry:

Law (0.68)
Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.49)

Add feedback

Chasing Fairness Under Distribution Shift: A Model Weight Perturbation Approach

Neural Information Processing SystemsFeb-17-2026, 02:21:06 GMT

Fairness in machine learning has attracted increasing attention in recent years.

artificial intelligence, distribution shift, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > North Brabant > Eindhoven (0.04)
North America > United States > Texas (0.04)
North America > United States > Michigan (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Industry: Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Data Science (0.93)

Add feedback

c9cd2d12abe92f30b1442557bdbe8f5a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 02:21:02 GMT

data mining, distribution shift, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > North Brabant > Eindhoven (0.04)
North America > United States > Texas (0.04)
North America > United States > Michigan (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Industry: Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

A Closer Look at the CLS Token for Cross-Domain Few-Shot Learning

Neural Information Processing SystemsFeb-16-2026, 22:47:18 GMT

Vision Transformer (ViT) has shown great power in learning from large-scale datasets. However, collecting sufficient data for expert knowledge is always difficult. To handle this problem, Cross-Domain Few-Shot Learning (CDFSL) has been proposed to transfer the source-domain knowledge learned from sufficient data to target domains where only scarce data is available.

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Hubei Province (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

final_openreview_d4_corrected_footnote

Kushal Tirumala

Neural Information Processing SystemsFeb-16-2026, 10:17:23 GMT

arxiv preprint arxiv, large language model, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Vision (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

UDA

Neural Information Processing SystemsFeb-16-2026, 01:34:04 GMT

Cleaning missing values: The human-generated questions may be unanswerable. Thus, we remove the Q&A items that lack available answers. Additionally, documents lacking any valid Q&A pairs are also removed.

artificial intelligence, dataset, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.30)
Asia > China > Shanghai > Shanghai (0.06)
Asia > Singapore (0.05)

Industry: Law (0.96)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Filters

Collaborating Authors

source dataset

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model Performance

f0e5cde3850e7dd0db125c0ebae16680-Supplemental-Conference.pdf

Mixed Supervised Object Detection by Transferring Mask Prior and Semantic Similarity

ASelf Supervised Learning Methods

A Datasheet for S

Chasing Fairness Under Distribution Shift: A Model Weight Perturbation Approach

c9cd2d12abe92f30b1442557bdbe8f5a-Paper-Conference.pdf

A Closer Look at the CLS Token for Cross-Domain Few-Shot Learning

final_openreview_d4_corrected_footnote

UDA